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Abstract 

Many existing medium access control (MAC) protocols utilize past information (e.g., the results 
of transmission attempts) to adjust the transmission parameters of users. This paper provides a general 
framework to express and evaluate distributed MAC protocols utilizing a finite length of memory for 
a given form of feedback information. We define protocols with memory in the context of a slotted 
random access network with saturated arrivals. We introduce two performance metrics, throughput and 
average delay, and formulate the problem of finding an optimal protocol. We first show that a TDMA 
outcome, which is the best outcome in the considered scenario, can be obtained after a transient period 
by a protocol with {N — l)-slot memory, where is the total number of users. Next, we analyze the 
performance of protocols with 1-slot memory using a Markov chain and numerical methods. Protocols 
with 1-slot memory can achieve throughput arbitrarily close to 1 (i.e., 100% channel utilization) at the 
expense of large average delay, by correlating successful users in two consecutive slots. Finally, we apply 
our framework to wireless local area networks. 

Index Terms 

Access control, access protocols, communication systems, distributed decision-making, multiaccess 
communication. 

I. Introduction 

In multiaccess communication systems, multiple users share a communication channel and contend for 
access. Medium access control (MAC) protocols are used to coordinate access and resolve contention 
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among users. We can categorize MAC protocoli^ into two classes, centralized and distributed protocols, 
depending on the existence of a central entity that coordinates the transmissions of users. Time division 
multiple access (TDMA) is an example of centralized protocols, where a scheduler assigns time slots 
to users. Centralized control can achieve a high level of channel utilization by avoiding collisions, but 
it requires large overhead for the communication of control messages. Slotted Aloha and IEEE 802.11 
distributed coordination function (DCF) are examples of distributed protocols. In slotted Aloha, users 
transmit new packets in the next time slot while retransmitting backlogged packets with a fixed probability. 
In DCF, carrier sense multiple access with collision avoidance (CSMA/CA) and binary slotted exponential 
backoff (EB) are used for users to determine their transmission times. These distributed protocols can be 
implemented without explicit control messages, but coordination is limited in that collisions may occur 
or the channel may be unused when some users have packets to send. 

In this paper, we aim to improve the degree of coordination attainable with distributed protocols by 
introducing memory into the MAC layer. Under a protocol with memory, a node dynamically adjusts its 
transmission parameters based on the history of its local information. The idea of utilizing histories at 
the MAC layer can be found in various existing protocols. For example, the slotted Aloha protocol IHJ 
and its generalized version |21 adjust the transmission probabilities of nodes depending on whether the 
current packet is new or backlogged. The pseudo-Bayesian algorithm of [3] utihzes channel feedback 
to update the estimated number of backlogged packets in the system, based on which the transmission 
probability is determined. The EB protocols in [4] use the results of transmission attempts to adjust the 
contention window and current window sizes of nodes or their transmission probabilities. 

The above protocols, however, utilize available past information in a limited way. We can consider 
the above protocols as the current state of a node determining its transmission parameters and the state 
transition occurring based on its local observations. Although this structure makes implementation simple 
in that nodes can simply keep track of their states in order to make transmission decisions, there may be 
many possible paths that lead to the same state, and important information may be lost by aggregating 
different histories into a single state. For example, in slotted Aloha, a node with a backlogged packet 
uses the same transmission probability following a slot in which it waited and following a slot in which 
it transmitted and collided. However, these two outcomes are observable by the node, and a significant 
performance improvement may be achieved by using different transmission probabilities following the 



'since we deal with MAC protocols exclusively in this paper, we use the term "protocol" to represent "MAC protocol" 
hereafter. 
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two outcomes. Another limitation of the above protocols is that they are designed assuming a particular 
form of feedback information. In case that more informative feedback is available, utilizing the additional 
information may result in performance gains. For example, the EB protocols in H prescribe that a node 
should not update its parameters following a waiting slot. If a node can sense the channel while waiting, 
utilizing the information obtained from sensing may improve the performance of the EB protocols. 

In order to overcome the limitations of the existing protocols utilizing memory, we provide a systematic 
framework to express and evaluate protocols with memory in the context of a slotted multiaccess 
system with saturated arrivals where nodes make transmission decisions based on their transmission 
probabilities. Our framework allows us to formally express a protocol utilizing memory of any finite 
length and operating under any form of feedback information. Also, we introduce two performance 
metrics, throughput and average delay, based on which we can evaluate protocols with memory. The two 
main results of this paper can be summarized as follows. 

1) In the considered scenario with saturated arrivals, TDMA is the best protocol in a sense that there is 
no other protocol that achieves a higher throughput or a smaller average delay. A TDMA outcome 
can be obtained after a transient period by a protocol with {N — l)-slot or A^-slot memory, where 

is the total number of nodes in the system. 

2) A protocol with 1-slot memory can achieve throughput arbitrarily close to 1 (i.e., 100% channel 
utihzation) at the expense of large average delay, by correlating successful nodes in two consecutive 
slots (i.e., a node that has a successful transmission in the current slot has a high probability of 
success in the next slot). 

The proposed protocols with memory can be related to splitting algorithms |5| and reservation Aloha 
|[6l . In splitting algorithms such as tree algorithms [7j|, backlogged nodes are divided into groups, one 
of which transmits in the next slot. Protocols with memory use histories to split nodes into groups. As 
nodes randomly access the channel based on transmission probabilities, histories wiU evolve differently 
across nodes as time passes. The probability of a successful transmission can be made high by choosing 
transmission probabilities in a way that the expected size of the transmitting group is approximately one 
most of the time. In reservation Aloha, nodes maintain frames with a certain number of slots, and a 
successful transmission serves as a reservation for the same slot in the next frame. Reservation Aloha 
can thus be expressed as a protocol with memory whose length is equal to the number of slots in a 
frame, provided that all nodes can learn successful transmissions in the system. Protocols with memory 
are more flexible than reservation Aloha in that protocols with memory can specify different transmission 
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probabilities in non-reserved slots, can make reservations in a probabilistic way, and can be implemented 
with an arbitrary form of feedback information. 

The rest of this paper is organized as follows. In Section II, we describe the considered slotted 
multiaccess model and define protocols with memory. In Section III, we introduce performance metrics 
and formulate the problem of finding an optimal protocol. In Section IV, we show that a protocol with 
{N — l)-slot or A^-slot memory can achieve the performance of TDMA. In Section V, we analyze 
the properties of protocols with 1-slot memory using numerical methods. In Section VI, we show that 
protocols with memory can be applied to the wireless local area network (WLAN) environment and can 
achieve a performance improvement over DCF. We conclude the paper in Section VII. 

II. System Model 

A. Setup 

We consider a slotted multiaccess system as in |2|. The system has total N contending users, or 
transmitter nodes, and the set of users is denoted hy J\f = {1, . . . , N}. We assume that the number of 
users is fixed over time and known to users. Users share a communication channel through which they 
transmit packets. Time is slotted, and users are synchronized in their slot transmission times. We label 
slots by t = 1, 2, .... A user always has a packet to transmit and can attempt to transmit one packet in 
each slot. The set of actions available to a user in a slot is denoted hy A = {T, W}, where T stands 
for "transmit" and W for "wait." We denote the action of user i hy Ui £ A and an action profile, or a 
transmission outcome, by a = (oi, . . . , oat). The set of (transmission) outcomes is denoted by ^ = A^ . 
A transmission is successful if it is the only transmission in the slot, and two or more transmissions in 
the same slot result in a collision. 

B. Feedback Information 

After a user transmits a packet, it learns whether the packet is successfully transmitted or not using an 
acknowledgement (ACK) response. If the user receives an ACK from the receiver node, it learns that its 
transmission was successful. Otherwise, it concludes that its packet has collided. We assume that there is 
no error in the transmission and the reception of ACK responses so that a user always learns the correct 
results of its transmission attempts. Formally, we represent ACK feedback to user i by j4Ci^j(a), which 
takes the value yes if a = a*, where a* is the outcome in which only user i transmits, and no otherwise. 

At the end of each slot, users obtain channel feedback about the number of transmissions in that slot. 
Let A; (a) be the number of transmissions in outcome a. The set of possible numbers of transmissions in a 
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slot is /C = {0, 1, . . . , N}. Let Hi be an information partition of /C for user i HI. Then channel feedback 
to user I in a slot with outcome a is given by the element of Hi that contains /c(a), which we denote 
by hi{a.). The definition of information partitions requires that A; (a) S /ii(a) for all a € ^. That is, 
channel feedback never leads users to regard the actual number of transmissions as impossible. However, 
when channel feedback has errors, it is possible for users to obtain incorrect channel feedback such that 
k{a.) ^ hi{a). In the presence of the hidden terminal problem (also called "erasures" in |]i9|), users may 
interpret a success or a collision slot as an idle slot. Under noise errors 191, users may interpret an idle or 
a success slot as a collision slot. It is also possible that users obtain channel feedback in a probabilistic 
way, in which case /ii(a) is determined according to a probability distribution on Hi, A(i/j)(a). 

All the aforementioned channel feedback settings can be incorporated in the formulation of the protocol 
designer's problem developed in Section III. Although modeUng general settings can be of importance in 
practice, we impose the following simplifying assumptions on channel feedback for analytic convenience. 
First, we assume that channel feedback is generated in a deterministic way. Then channel feedback to 
user i can be represented by a mapping hi from A to Hi, instead of A{Hi). Second, we assume that 
A;(a) G /ii(a) for all a G ^ so that channel feedback always contains the actual number of transmissions. 
Finally, we assume that every user obtains the same channel feedback, i.e., hi = ■ ■ ■ = h^ = h, which 
requires Hi = ■ ■ ■ = Hn — H. Under these three assumptions, a channel feedback model is completely 
described by an information partition H of /C. 

Consider a partition of /C, {{0}, {!}, e}, where e = {2, . . . , N}. Each element of the partition represents 
a type of channel feedback, and users can potentially learn whether there has been zero packet (idle), one 
packet (success), or more than one packet (collision) transmitted in the slot. With an abuse of notation, 
we will use and 1 to represent channel feedback that corresponds to {0} and {1}, respectively |5]. We 
say that the channel possesses ternary feedback, or (0, 1, e) feedback, if all three types of feedback are 
available. Binary feedback is also possible, and given the three possible types of channel feedback, we 
can consider three kinds of binary feedback: success/failure (S/F) feedback, which informs users whether 
there was a successful transmission (1) or not (0 U e); collision/no collision (C/NC) feedback, which 
informs users whether there was a collision (e) or not (0 U 1); and empty/not empty (E/NE) feedback, 
which informs users whether the current slot was empty (0) or not (1 U e) [9^1- We can also consider no 
channel feedback, which does not give any channel information to users and corresponds to information 
partition {/C}, and [N + l)-ary feedback, which informs users of the exact number of transmissions 
and corresponds to information partition {{0}, {!}, . . . , {A^}}, the finest partition of /C. (See |,10il for a 
similar list of channel feedback models in a multiple reception scenario.) 
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The feedback information of user i consists of ACK feedback and channel feedback, i.e., {ACKi{aL), hi{a)). 
We define a feedback technology as a rule that generates feedback information for each user depending 
on transmission outcomes. The above three assumptions on channel feedback allow us to represent a 
feedback technology by a mapping p : S ^ 2^, where S = Uae.4{(ai> ^(a))}> and write the feedback 
information of user i more compactly as p{ai,k{aL)). Since ACKi{aL) = no if a, = W, we can set 
p{W, k{a)) = /i(a). Since user i can distinguish the outcomes in 1 from those in e using ACK feedback 
whenever it transmits, we can set p(T, A;(a)) = 1 if ACKi{a) = yes and h{a) n e if ACKi{a) = no. 
Let Zp = UaeA{p{o,i: k{a))}, which is independent of i by the symmetry assumption. Zp represents the 
set of feedback information that a user can obtain with feedback technology p. The feedback information 
of user i is denoted by Zi G Zp. We use TZ to denote the set of all feedback technologies, which is 
equivalent to the set of all partitions of /C. 

C. Protocols With Memory 

A user decides whether to transmit or not in each slot using a transmission probability, which lies in 
[0, 1]. A protocol is a rule based on which users determine their transmission probabilities. We assume 
that control or coordination messages cannot be used in the system. Then the transmission action and 
the feedback information of user i are all the information that it obtains in a slot. The M-slot history for 
user i in slot t is given by 

Tt_(^t-M t~M. . t-1 t^l^ 

for M = 1, 2, . . . and t = 1, 2, . . .j] We set (af, 4) = (W^> P{W, 0)) for t' < as initialization. Let Cp be 
the set of possible action-feedback pairs under feedback technology p, i.e., £p = Uae^{(aj, /o(ai, A;(a)))} C 
A X Zp. Then the set of M-slot histories is given by A stationary decision rule based on M-slot 
histories is defined by a mapping 

where f{L) represents the transmission probability for a user whose M-slot history is L € C^^ . 

Since the set of Af -slot histories C'^^ is affected by feedback technology p, the set of stationary decision 
rules based on M-slot histories depends on the feedback technology of the system. In particular, as the 
feedback technology is more informative in a sense that the corresponding information partition is finer, 

^In addition to actions and feedback, a user knows the transmission probabilities it has used. We do not include past transmission 
probabilities in histories because we focus on protocols that do not depend on past transmission probabilities directly. 
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users can distinguish more outcomes, and thus more decision rules can be deployed. We use Fm,p to 
denote the set of all stationary decision rules based on M -slot histories under feedback technology p. We 
define a protocol with M-slot memory as a profile of stationary decision rules based on Af -slot histories 
given a feedback technology p, i.e., f = (/i, . . . ,/iv) € p- ^^^^ ^ protocol f is symmetric 

if it prescribes the same decision rule to every user, i.e., /i = • • • = /at. We will sometimes use / to 
represent a symmetric protocol f with a common decision rule / when there is no confusion. 



D. Automaton Representations of Protocols With Memory 

A protocol with memory can be described by a finite automaton, which consists of a finite set of states, 
an initial state, an action rule, and a state transition rule lITTl . We can represent a symmetric protocol 
/ S J^M,p as a finite automaton by defining states as possible M-slot histories, the initial state as the 
M-slot history obtained from M idle slots, the action rule as the decision rule /, and the state transition 
rule as to specify the new state as the M-slot history updated based on the transmission action and the 
feedback information in the current slot. 

Fig. Qfa) shows the automaton representation of a protocol with memory in the simplest case of 1-slot 
memory and no channel feedback (i.e., ACK feedback only). With ACK feedback only, there are three 
possible action-feedback pairs, or 1-slot histories: (1^,0), (T, 1), and (T, e), where corresponds to 
channel feedback K,. Hence, there are three states in the automaton, corresponding to the three possible 
action-feedback pairs, with the initial state specified as {W, 0). The transmission probability in each state 
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{a, z) is given by f{a,z). The state in the next slot is determined by the action-feedback pair in the 
current slot. 

Ill proposes generalized slotted Aloha protocols, which we call two-state protocols. Under a two-state 
protocol, users choose their transmission probabilities depending on whether their current packets are 
new (a free state) or have collided before (a backlogged state). Thus, having ACK feedback suffices 
to implement a two-state protocol. The automaton representation of a two-state protocol is shown in 
Fig- mtb)! In general, a two-state protocol cannot be expressed as a protocol with finite memory. Since 
the state remains the same following a waiting slot, the transmission probability for the current slot is 
determined by the result of the most recent transmission attempt, which may have occurred arbitrarily 
many slots ago. However, 121 shows that under saturated arrivals two-state protocols achieve the maximum 
throughput when pp = 1 and pc — )■ 0, where pp and pc are transmission probabilities in the free and 
backlogged states, respectively. When a two-state protocol specifies pp = 1, which is also the case in the 
original slotted Aloha protocol, the action-feedback pair {W, 0) cannot occur in the free state and thus we 
can find an equivalent protocol with 1-slot memory such that /(T, 1) = 1 and f{W, 0) = /(T, e) = pc- 
In effect, a two-state protocol with pp = I aggregates the two different action-feedback pairs, (VF, 0) 
and (T, e), into one state, the backlogged state, and assigns the same transmission probability following 
these two action-feedback pairs. On the contrary, a protocol with 1-slot memory can fully utilize all 
the available information from the previous slot in that three different transmission probabilities can be 
assigned following the three action-feedback pairs. 

As will be shown later in Section V.D., the limited utilization of past information by two-state protocols 
results in performance limitations. In particular, throughput, or the fraction of success slots, is bounded 
from above by N/{2N — 1) under two-state protocols (Theorem 2 of |2]) while protocols with 1- 
slot memory can achieve throughput arbitrarily close to 1. When memory longer than 1 slot or channel 
feedback is available, there are more distinguishable histories for each user. Two-state protocols aggregate 
different histories into just two states whereas protocols with memory can assign as many transmission 
probabilities as the number of possible histories given the length of memory and the feedback technology. 
Thus, the performance gap between the two kinds of protocols will be larger when longer memory or 
more informative feedback is available. 
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III. Problem Formulation 

A. Performance Metrics 

1 ) Throughput: We define the throughput of a user as the fraction of slots in wliich it has a successful 
transmission and total throughput as the fraction of slots in which there is a successful transmission in 
the system, which is equal to the sum of the individual throughput of users. When users follow a protocol 
with memory f , throughput can be computed using a Markov chain. Let M be the length of memory 
used by protocol f and p be the associated feedback technology. Then we can consider a Markov chain 
whose state space is given by A^. We write an element of as a = (ai, . . . , slm)- Let -Li(a) be the 
M-slot history for user i when the outcomes in the recent M slots are a, i.e., 

Li(a) = (ai,i,p(ai,i,A;(ai));...;ai,M,p(aj,M,A;(aM))), 

where ^ is the ith element of a„i. Given Lj(a) as its M-slot history, user i transmits with probability 
fi{Li{aL)). The transmission probabilities of users yield a probability distribution on the outcome in the 
current slot. The probabihty that the current outcome is a' under f when the outcomes in the recent M 
slots are a is given by 

N 

P(a'|a;f) = [] [l{a:a.=T}(a')/^(^^(a)) + l{a:a.=iy}(a')(l - MHm] , (1) 

i=l 

where 1b '■ A ^ {0, 1} is an indicator function such that 1b{^) = 1 if a G B and otherwise. The 
transition probabihty from a G A^ to a' = (a'^, . . . , a^) G A^ under protocol f is given by 

P(a'jy^|a; f) if a^ = a„+i, for all m = 1, . . . , M - 1, 
otherwise. 

Let Vt(f) be the probability distribution on the state space A^ in slot t induced by protocol f. By 
the initiahzation in the definition of protocols with memory, the initial distribution Vo(f) has element 1 
for a = (a^, . . . , a°) and elsewhere, where aP denotes the idle outcome, {W, . . . , W). Let Q(f ) be the 
transition matrix of the Markov chain under protocol f . Then the probabihty distribution on A^ in slot 
t can be computed by 

vt(f) = vo(f)Q(f)*, 

for t = 1,2, . . .. Let 5* be the subset of A'^ with user i's success as the most recent outcome, i.e., 
<S* = {a G A^ : SLM = a*}. Then the probability of user i's success in slot t is given by ri,j(f) = 



Q(a'|a;f) 
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SaG5' '^<(^> f)- "^^^ fraction of slots with user i's success, or the throughput of user i, is given by 

1 

r,(f) = lim 7 VTi,t(f), 

J-5-CXD J ' 

t = l 

assuming that the Umit exists. If f is chosen so that the induced Markov chain has only one closed 
communicating class, then there exists a unique stationary distribution v(f), independent of the initial 
distribution vo(f), which satisfies 

v(f) = v(f)Q(f) and v(f)e = 1, (2) 

where e is the column vector of length lA'^^] whose elements are all 1 fTT]. Then the expression for 
the throughput of user i is reduced to ri(f ) = J2aeS' ^(^5 f )■ Finally, the total throughput of the system 
under protocol f is given by 

N 

r(f) = J]r.(f). 

i=l 

2) Average Delay: The average delay of a user is defined as the average waiting time, measured in 
the unit of slots, until the beginning of its next successful transmission starting from an arbitrarily chosen 
time. Average delay under a protocol with memory f can be computed using a Markov chain. Consider 
a slot to which an arbitrarily chosen time belongs, and let a G A^'^ be the outcomes in the recent M 
slots. Given a, f yields a probability distribution on the outcome in the next slot, P(a'|a;f), given by 
O. Using i-*(a'|a; f), we can compute the probability that user i succeeds for the first time after r slots 
when the outcomes in the recent M slots are given by a and users follow f , 

fj.i{r; a, f) = i-*r{a(_|_,. = a* and at+r' 7^ a*, for all r' = 1, . . . , r — l|a( = a; f}, (3) 

for r = 1, 2, . . .. For example, 

/Xi(l;a,f) = P{at+i = a*|at = a;f) and 

/Xi(2;a,f)= ^ P(at+2 = a*|at+i = (ai,a);f)P(at+i = a|at = a;f), 

a6^\{a'} 

where (at,a) G A^^ is obtained by deleting the first outcome in a^ and adding a as the most recent 
outcome. Using ([3]), we can compute the average number of slots until user i's next success starting from 
a slot with the recent M outcomes a: 

oo 

dj(a;f) = y~];Uj(r;a, f) r. 

r=l 
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Since the distribution on the recent M outcomes in slot t is given by Vt(f), the average delay of user i 
under protocol f can be computed as 



where 0.5 is subtracted to take into account that the average time staying in the initial slot is 0.5 starting 
from an arbitrarily chosen time. When the Markov chain has a unique stationary distribution v(f), the 
expression for the average delay of user i is reduced to 



3) Discussion on Throughput and Average Delay: The throughput of the system can be considered as 
an efficiency measure of a protocol as it gives the channel utilization over time. However, as its definition 
suggests, throughput reflects the performance averaged over a long period of time and does not contain 
much information about the short-term and medium-term performance. As an illustration, consider two se- 
quences of outcomes generated by repeating (a^ , , . . . , ) and (a^ , a^ , a^ , a^ , a^ , a^ , a^ , a^ , a^ , a^ , . . . , 
a^, a^, a^, a^, a^). In both sequences, each user succeeds in one out of N slots over a long period of 
time, as measured by throughput 1/N. However, users may prefer the first sequence to the second one 
as the first exhibits the steadier performance over a short period of time. For example, suppose that a 
user counts its successes in N consecutive slots from an arbitrary slot. The first sequence guarantees one 
success in N consecutive slots regardless of the initial slot. On the contrary, in the second sequence, N 
consecutive slots contain no success with a high probability and many successes with a low probability. 
The variation in the performance over a short period of time is captured by average delay. 

A widely-used measure of delay in queueing theory is the inter-packet interval. In our model, we can 
compute the average inter-packet time of user i, i.e., the average number of slots between two successes 
of user i, using 



We can interpret (ii(a;f) as the average time to enter one of the states in <S* starting from the state a 
under protocol f . Then -Di(f) can be interpreted as the average time to enter starting from an arbitrary 
state, where the starting state is chosen following the stationary distribution. Similarly, Di{f) can be 
interpreted as the average time to return <S* starting from a state in S^. 
Note that di(a;f) satisfies 




A(f)= J2 ^(a;fH(a;f)-0.5. 




(4) 




(5) 
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for all a G A^^ , where 5* is the complement of 5*, i.e., 5* = A^'^ \ 5*. Using ^ and ([Sjl, we can show 
that X]aG5' ^(^' f) = 1 foi" ^i^y protocol with memory f, and thus (|4]| can be rewritten as 



which can be regarded as a version of Little's Theorem ||5l. Therefore, the average inter-packet time is 
completely determined by throughput, and thus it provides no additional information beyond that provided 
by throughput. 

The waiting time paradox fT3\ suggests that not only the mean of the inter-packet time but also 
the variance of the inter-packet time matters for the average waiting time measured from an arbitrarily 
chosen time. Let Xi be the random variable that represents the inter-packet time of user i with support 
N = {1,2, . . .}. By definition, we have E{Xi) = Di. The PoUaczek-Khinchine (P-K) formula IHl gives 
the average residual time until the next success, or average delay, by 



where Ki represents the coefficient of variation of Xi, i.e., Kj = -s/VarlXi)/ E{Xi). The P-K formula 
shows that average delay is increasing in the variance of the inter-packet time for a given level of 
throughput (or equivalently, for a given mean of the inter-packet time). 

Consider the following three examples with a fixed level of the average inter-packet time Di and differ- 
ent levels of the coefficient of variation Kj. (i) Suppose that user i has periodic successful transmissions. 
Then it transmits a packet successfully once in Di slots, and its average delay is Di/2 since Ki = 0. Note 
that Di /2 is the smallest average delay achievable with protocols that yield the average inter-packet time 
Di. (ii) Suppose that successes for user i are completely random in a sense that Xi follows a geometric 
distribution. Such a random variable can be generated by a memoryless protocol that prescribes a fixed 
transmission probability for each user. In this case, k? = 1 — 1/Di, and thus Di = Di — 0.5. (iii) Suppose 
that the successes of user i are highly correlated over time. Then » 1, and thus Di ^ Di. In this 
case, user i has frequent successes for a short period of time but sometimes has to wait for a long period 
of time until its next success. 

It is reasonable to assume that users prefer to have a steady stream of transmissions as well as a high 
transmission rate. As the above examples illustrate, the coefficient of variation of the inter-packet time 
measures the volatility in successful transmissions over time. Since average delay reflects the coefficient of 
variation of the inter-packet time, we use it as a second performance metric to complement the long-term 
performance metric, throughput. 




(7) 



13 



B. Problem of the Protocol Designer 

We formulate the problem faced by the protocol designer as a two-stage procedure. First, the protocol 
designer chooses the length of memory, M slots, and the feedback technology, p. Next, the protocol 
designer chooses a protocol from the class of available protocols given the stage-one choice (M, p). For 
simplicity, we assume that the protocol designer considers only symmetric protocols. For a symmetric 
protocol /, we have ri(/) = • • • = T]\j{f) and Di{f) = • • • = Dj\[{f). Total throughput is given by 
r(/) = NTi{f) for any i G N, and we use D{f) to denote the average delay of each user. 

The problem of the protocol designer can be written formally as 



U represents the utility function of the protocol designer, defined on total throughput and average delay. 
We assume that U is increasing in total throughput and decreasing in average delay so that the protocol 
designer prefers a protocol that yields high total throughput and small average delay. C represents the 
cost function of the protocol designer, defined on the length of memory and the feedback technology. We 
assume that C is increasing as memory is longer and as the feedback technology is more informative. 
Note that from a practical point of view the cost of expanding memory is vanishingly small compared 
to the cost associated with the feedback technology, which implies that the system is more likely to 
be constrained by the available feedback technology rather than by the size of memory. Hence, it is 
more natural to interpret the cost associated with memory as the cost of implementing protocols with a 
certain length of memory. For instance, the protocol designer may prefer protocols with short memory 
to protocols with long memory because the former is easier to program and validate than the latter. We 
say that a protocol / is optimal given {M,p) if it attains maxjgj-j^j^ U{T(f),D{f)). We say that / is 
an optimal protocol for the protocol designer if / solves 

In order to analyze the performance of protocols with memory, we approach the protocol designer's 
problem from two different directions. In Section IV, we first find the feasible throughput-delay pair most 
preferred by the protocol designer. Then we show that the most preferred throughput-delay pair can be 
achieved by a protocol with memory. In Section V, we focus on the simplest class of protocols with 
memory, namely protocols with 1-slot memory, and investigate their properties using numerical methods. 




(8) 
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IV. Most Preferred Protocols 

Suppose that the protocol designer can choose any protocol at no cost. Then the protocol designer's 
problem becomes 



where T denotes the set of all symmetric protocols including those that cannot be expressed as protocols 
with memory. We say that / is a most preferred protocol if / solves Consider a fixed level of total 
throughput r, which implies individual throughput t/N hy the symmetry assumption on protocols. Note 
that the P-K formula ([7]) was obtained without imposing any structure on protocols, and it shows that for 
a given level of throughput, average delay is minimized when there is no variation in the inter-packet 
time of a user, i.e., when = 0. Hence, combining ^ and ([7]), we can express the minimum average 
delay given total throughput r as 



Since Dmin{T) is decreasing in r, setting total throughput at the maximum level 1 yields the minimum 
feasible average delay -Dmm = ^mm(l) = N/2. In other words, there is no protocol / € J-" that attains 
D{f) < N/2. Hence, if there exists a protocol / that achieves the maximum total throughput 1 and the 
minimum average delay N/2 at the same time, then / is a most preferred protocol. 

In order to obtain the most preferred throughput-delay pair (1, N/2), a protocol needs to provide each 
user with a successful transmission in every N slots. TDMA is a protocol that achieves such a sequence 
of outcomes. Since the labels of users are arbitrary, we can describe the TDMA protocol as having user 
i transmit in slot t if {t mod N) = {i mod N) and wait in all other slots. Then each user has one 
successful transmission in every N slots, and thus the TDMA protocol is a most preferred protocol. 
However, the TDMA protocol requires coordination messages to be sent to users in order to assign time 
slots, and thus it does not belong to the class of protocols with memory defined in Section II.C. The 
following theorem establishes the existence of a most preferred protocol in the class of protocols with 
memory. 

Theorem 1: Assume {N — l)-slot memory and success/ failure (S/F) binary feedback psF- Denote an 
[N — l)-slot history by L = (a^, z^; . . . ; a^~^ , z^~^), and let n{L) be the number of successes in L, 
i.e., n{L) = \{m : = l,m = 1, . . . , N — Define a protocol with memory / € J-'(jv-i),psp by 



inaxC/(r(/),D(/)) 



(9) 





if L contains (T, 1) 



otherwise. 
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Then r(/) = 1 and D{f) = N/2. 

Proof: Since f{L) = if L contains (T, 1), a user waits for {N — 1) slots following its success. 
Also, users that have no success in recent (A^ — 1) slots compete among themselves with the transmission 
probability equal to the reciprocal of the number of such users. Suppose that there is a success in each of 
the recent {N — 1) slots for the first time in the system. Since a user waits for (A^ — 1) slots following its 
success, the {N — 1) successes must be by different users. Then the only user that has not had a success 
in the recent {N — 1) slots transmits with probability 1 while other users wait in the current slot. From 
that point on, a cycle of length N containing one success of each user is repeated. Since the probability 
of having {N — 1) consecutive successes before slot t approaches 1 as t goes to infinity, / achieves a 
TDMA outcome after a transient period with probability 1 , and thus it is a most preferred protocol. ■ 

Theorem 1 shows that with {N — l)-slot memory and S/F feedback, users can determine their 
transmission slots in a distributed way without need of explicit coordination messages, thereby achieving 
the same outcome as TDMA after a transient period. By trial and error, users find their transmission 
slots in a self-organizing manner and stabilize to transmit once in every N slots. Note that (A^ — 1) is 
the minimum length of memory required to emulate TDMA using a protocol with memory. A cycle of 
successes by A^ users cannot be generated with memory shorter than (A^ — 1) slots. Also, S/F feedback 
is necessary to guarantee a success after (A^ — 1) consecutive successes. 

The expected duration of the transient period can be shortened by using A^-slot memory. Consider a 
protocol with A^-slot memory, / G J^n,psf' defined by 

1 if{a\z^) = {T,l), 

if (a\z^) = (W,l), 

f{L) = { (10) 

if 1 and (a™, z"") = (T, 1) for some m = 2,...,N, 

1/{N -n{L)) if 1 and (a™,^"') / (T,!) for all m = 2, . . . , A^, 
where n(-L) is now the number of successes in recent (A^ — 1) slots. The first and the third lines of (ITOl) 
state that once a user succeeds, it waits for (A^ — 1) slots before the next transmission. The second and 
the fourth lines state that a user with no success in recent A^ slots waits if the current slot is already 
"reserved" by some user that succeeded A^ slots ago and contends with other such users if no user 
succeeded A^ slots ago. Hence, under /, once a user succeeds in slot t, it is guaranteed to succeed in 
slots t + N, t + 2N , and so on, whereas under / in Theorem 1 a user that succeeded A^ slots ago has to 
compete with other users with no success in recent (A^ — 1) slots during the transient period. Again, S/F 
feedback is necessary to let users know whether the current slot is reserved or not. / can be considered as 
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a generalization of reservation Aloha with a frame of N slots, where users can adjust their transmission 
probabilities depending on the number of contending users for non-reserved slots. 

In practice, the number of users may vary over time as users join and leave the system. When ternary 
feedback is available, users can find the exact number of users in the system using the protocol / as 
long as the number of users does not change too frequently. The initial estimate on the number of users 
is set to be the maximum number of users that the system can allow. When users follow / based on 
their initial estimate, there will be empty slots in a cycle obtained after a transient period. Once a cycle 
is repeated, users can construct a new cycle by deleting the empty slots, adjusting the number of slots 
in a frame equal to the actual number of users. If some users leave the network, empty slots will appear 
in a frame, and the remaining users can reduce the length of a frame by deleting the empty slots. If 
some users join the network, we require them to transmit immediately. Then a collision will occur, and it 
serves as a signal to notify the existing users that there arrived a new user in the system. Once a collision 
occurs after a transient period, users reset their estimates to the maximum number of users and repeat 
the procedure from the beginning in order to increase the length of a frame by the number of new users. 

V. Protocols With 1-Slot Memory 

A. Structure of Delay-Efficient Protocols 

We now examine protocols with 1-slot memory, which are the simplest among protocols with memory. 
Under protocols with 1-slot memory, users determine their transmission probabilities using their action- 
feedback pairs in the previous slot. In Appendix A, we explain in detail how to compute throughput and 
average delay under symmetric protocols with 1-slot memory using a Markov chain. Since the protocol 
designer prefers a protocol with small average delay for a given level of throughput, we consider the 
following reduced problem of the protocol designer: 

D*{t) = min D{f) subject to r(/) = r, (11) 

for T G [0, 1] and some feedback technology p. We say that a protocol / is delay-efficient if it solves 
([n]) for T = t(/), i.e., if D{f) = D*{T{f)). Also, we call the set of points {(r, D*{t)) : t G [0, 1]} the 
delay-efficiency boundary of protocols with 1-slot memory. Since computing r(/) and D{f) for a given 
protocol / G J^i p involves solving matrix equations, it is in general difficult to solve ([TTI) analytically, 
and thus we rely on numerical methods. 

To obtain numerical results, we consider ternary feedback, denoted by p3, and five users, i.e., = 5. 
In order to guarantee the existence of a stationary distribution for transition matrix Q(/), we restrict 
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the range of / (i.e., the set of possible transmission probabiUties) to be [10 '^,1 — 10 instead of 
[0, 1]. We use the function ^/?ico?i of MATLAB to solve the restricted version of cbI] Fig. I3ta)| depicts 
the delay-efficient boundary of protocols with 1-slot memory, shown as a U-shaped curve. Since the 
optimization problem to find delay-efficient protocols is not necessarily convex, it is possible that the 
numerical results locate local minima instead of global minima. To validate the delay-efficiency of the 
protocols obtained by the finincon function, we generate 5,000 symmetric protocols with 1-slot memory 
where transmission probabilities are randomly chosen on [0, 1] and plot total throughput and average 



delay under those protocols as dotted points in Fig. I31a)| (in the figure, only 3,631 points with average 
delay less than 100 are shown). The results in Fig. ffa)! suggest that the numerically computed protocols 

:ientO Fig. 



are indeed (at least approximately) delay-efficient 



ig- plots the transmission probabilities under 



the delay-efficient protocols, denoted by /*, as r varies. It shows that the structure of /* changes around 

^We vary r from 0.99 to 0.01 with a step size of 0.01. We choose the initial protocol for fmincon as 
(/(W", 0), f{W, 1), f{W, e), /(T, 1), /(T, e)) = {1/N, 0, 1/(7V - 2), 1, 0) for r = 0.99 and use the solution for r = nx 0.01 
as the initial protocol for r = (n — 1) x 0.01 from n = 99 to n = 2. 

''Another interesting point to notice from Fig.|Ja)]is that most of the randomly chosen protocols yield throughput-delay pairs 
close to those achieved by memoryless protocols. It suggests that protocols with 1-slot memory need to be designed carefully 
in order to attain total throughput above the maximum level achievable with memoryless protocols. 
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T = 0.41, which is also the turning point of the delay-efficiency boundary. There is numerical instability 
for T between 0.41 and 0.48 in that the solution depends highly on the initial protocol. In fact, for r 
in that region we can find protocols that yield average delay slightly smaller than those in Fig. | 3ta)| by 
specifying different initial protocols. 

In the following, we provide an explanation for the shape of the delay-efficiency boundary by in- 
vestigating the structure of the delay-efficient protocols. In the low throughput region, (r < 0.21), the 
structure of /* is given by 

r(W,0) G [0,0.2], nW,l) = 1, nW,e) = 0, r(r,l) = 0, r(r,e) G [0.65,1]. 

Since f*{T, 1) = and f*{W, 1) = 1, a success lasts for only one slot and is followed by a collision. 
Since /*(r, e) is high, a collision is likely to be followed by another collision. Similarly, an idle slot is 
likely to be followed by another idle slot since f*{W, 0) is low. This means that total throughput is kept 
low by inducing many idle or collision slots between two successes. Hence, as total throughput increases 
in the low throughput region, the expected number of idle or collision slots between two successes is 
reduced, and as a result users transmit their packets more frequently. According to the P-K formula (O, 
average delay is determined by the mean and the coefficient of variation of the inter-packet time. On the 
left-hand side of the delay-efficiency boundary (r < 0.4), the average inter-packet time is reduced while 
the coefficient of variation of the inter-packet time remains about the same as r increases, resulting in 
the inverse relationship between throughput and average delay. 

In the high total throughput region (r > 0.82), the structure of /* is given by 

nw,o)^^, r(PF,i)«o, r{w,e)^j^, r(r,i) = i, r(r,e) = o. (12) 

In a slot following an idle slot, users transmit with probability close to 1/N. In a slot following a 
success, the successful user transmits with probability 1 while other users wait with high probability. 
The transmission probability of other users approaches as r becomes close to 1. In a slot following a 
collision, users that transmitted in the collision wait while other users transmit with probability close to 
1/{N — 2). Since a collision involving two transmissions is most likely among all kinds of collisions, 
setting f*(W,e) = 1/{N — 2) maximizes the probability of success given that colliding users wait. 
f*{T, 1) = 1 and f*{W, 1) is the key feature of protocols with 1-slot memory that achieve high 
total throughput. By correlating successful users in two consecutive slots, protocols with 1-slot memory 
can yield total throughput arbitrarily close to 1. However, higher throughput is achieved by allowing a 
successful user to use the channel for a longer period, which makes other users wait longer until they 
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Fig. 3: Illustration of an optimal protocol. 



transmit next time. On the right-hand side of the delay-efficiency boundary (r > 0.4), the coefficient of 
variation of the inter-packet time increases without bound as r approaches 1, resulting in the positive 
relationship between throughput and average delay. 

Since the protocol designer prefers a protocol that yields high total throughput for a given level of 
average delay, optimal protocols must lie on the right-hand side of the delay-efficiency boundary. As an 
illustrative example, suppose that the utility function of the protocol designer is given by 

U{t, D) = - max{200(l - r), 

Then an increase in total throughput by 0.1 has the same utility consequence as a decrease in average 
delay by 20 slots. In Fig. [3j dashed curves depict the indifference curves of the utility function, each of 
which represents the throughput-delay pairs that yield the same level of utiUty, while the arrow shows 
the increasing direction of the utility function. The throughput-delay pair that maximizes the utility of 
the protocol designer is (0.792,41.6), marked with an asterisk in Fig. [3] The protocol designer can 
determine the optimal protocol /" given {M,p) = {l,P3) by finding a protocol / G J^i,p3 that yields 
{T{f),D{f)) = (0.792,41.6), which is 

f°{W, 0) = 0.20, f°{W, 1) = 0.03, nW, e) = 0.34, /"(T, 1) = 0.99, /"(T, e) = 0. (13) 
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Total Throughput 

Fig. 4: Total throughput and average delay under delay-efficient protocols based on the estimated numbers 
of users when the actual number of users is ten. 

B. Robustness Properties of Delay-Efficient Protocols 

1 ) Unknown Number of Users: We relax the assumption that users know the exact number of users 
in the system. Instead, we assume that each user has an estimate on the number of users and executes 
the prescribed protocol based on the estimate^ We consider a scenario where there are ten users and 
ternary feedback is available. For simplicity, we assume that the objective of the protocol designer is to 
achieve total throughput 0.9 and that all users have the same estimate on the number of users, which will 
be the case when users use the same estimation method based on past channel feedback. Fig. |4] shows 
a portion of the delay-efficiency boundary with = 10 and plots the throughput-delay pairs when the 
ten users follow delay-efficient protocols that are computed to achieve total throughput 0.9 based on 
the estimated number of users, denoted by N, between 7 and 13. The results from Fig. |4] suggest the 
robustness of delay-efficient protocols with respect to variations in the number of users in a sense that 
as the estimated number of users is close to the actual one, the protocol designer obtains a performance 
close to the desired one. 

Based on the observations from Fig. IH we can consider the following procedure to dynamically adjust 
the estimates of users. Users update their estimates periodically by comparing the actual total throughput 

^In other words, the protocol designer specifies a protocol as a function of the numbers of users, not a fixed protocol. 
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since the last update with the desired total throughput^ Users increase (resp. decrease) their estimates 
by one if the actual total throughput is lower (resp. higher) than the desired total throughput by a certain 
threshold level. When designed carefully, this estimation procedure will make the estimated number of 
users converge to the actual number of users because the actual total throughput is lower than (resp. 
equal to, higher than) the desired total throughput if the estimated number of users is smaller than (resp. 
equal to, larger than) the actual number. This procedure can be regarded as an extension of the pseudo- 
Bayesian algorithm of ^ in which users adjust their estimates in every slot based on the channel feedback 
of the previous slot. In the proposed estimation procedure, memory is utilized not only to coordinate 
transmissions but also to estimate the number of users. 

2 ) Errors in Feedback Information: So far we have restricted our attention to deterministic feedback 
technologies. We relax this restriction and consider stochastic feedback technologies. As discussed in 
Section II.B., stochastic channel feedback for user i can be represented by a mapping hi from A to 
A{Hi). If the structure of channel feedback, {hi, . . . , /ijv), is known to the protocol designer, it can be 
modeled in the transition matrix Q(/) in Section III by extending the state space from A^^ to (C^)^^. 
Then the protocol designer can find an optimal protocol taking into account randomness in feedback 
information. 

Here we introduce random errors in channel feedback, which are not modeled by the protocol designer, 
and examine the performance of an optimal protocol in the presence of random errors. We assume that 
ternary feedback is available but subject to random errors. In particular, a user obtains the correct feedback 
signal with probability 1 — 2e and each of the two incorrect signals with probability e, for a small e > 0. 
For example, if there is no transmission in the system, then a user receives feedback with probability 
1 — 2e and each of feedback 1 and e with probability e. We assume that the feedback signals of users are 
independent. We continue to assume that ACK feedback is perfect, and thus transmitting users always 
learn the correct results of their transmission attempts, regardless of the realization of channel feedback. 

Table |I] shows the performance of the optimal protocol f° in ([T3] ) at the various levels of e when N = 5. 
To obtain the simulation results, we generate transmission decisions and feedback information for 100, 000 
slots, for each level of e. Table |I] suggests that delay-efficient protocols have a robustness (or continuity) 
property with respect to random errors in feedback information, since the obtained performance is close 
to the desired one when the error level is small. Note that an error occurring to a waiting user following a 
success induces the user to transmit with a higher probability (i.e., to transmit with probabiUty f°(W, 0) 



*Note that users can compute the actual total throughput using ternary feedback. 
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TABLE I: Total throughput and average delay with a stochastic feedback technology. 
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or f°{W, e) instead of f°{W, 1)), making consecutive successes last shorter. Thus, as the error level 
increases, both total throughput and average delay decrease. The obtained throughput-delay pairs remain 
close to the delay-efficiency boundary, suggesting that errors following an idle or a collision slot cause 
little performance degradation. 

C. Comparison of Channel Feedback 

We now analyze the impact of the different forms of channel feedback on the performance of protocols 
with 1-slot memory. As mentioned in Section II.C, the set of available protocols expands as the 

feedback technology p becomes more informative (in other words, a protocol / G Tm.p can always be 
replicated by another protocol /' € J^M,p' if p' is more informative than p). This relationship implies 
that a more informative feedback technology yields a lower delay-efficiency boundary for a given length 
of memory. We consider six feedback technologies with ACK feedback and different channel feedback 
models: no channel feedback, S/F binary feedback, C/NC binary feedback, E/NE binary feedback, ternary 
feedback, and {N + l)-ary feedback, as introduced in Section II.B. Fig. [5] depicts the delay-efficiency 
boundaries of protocols with 1-slot memory under the six feedback technologies. As expected, for a 
given level of throughput, the minimum average delay becomes smaller as we move from no feedback 
to binary feedback, to ternary feedback, and to {N + l)-ary feedback^ 

'in Fig. [5] average delay is smallest under C/NC binary feedback for r — 0.42 and 0.43, which is a result of numerical 
instability around the turning point of the delay-efficiency boundaries, as pointed out in Section V.A. 
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Fig. 5: Delay-efficiency boundaries under various channel feedback. 

Fig. |5] shows that in the operating region of the protocol designer (i.e., the right-hand side of the delay- 
efficiency boundaries), protocols with 1-slot memory perform worse under no channel feedback and C/NC 
binary feedback than under other considered channel feedback. In order to obtain high throughput with 
1-slot memory, we need a high correlation between successful users in two consecutive slots, which 
requires f{W, 1) and f{T, 1) « 1. However, under no channel feedback or C/NC binary feedback, a 
waiting user cannot distinguish between idle slots and success slots, leading to f{W,0) = f{W, 1) ^ 0. 
This makes idle slots last long once one occurs, resulting in large average delay. Under S/F binary 
feedback, a user is constrained to use f{W,0) = f{W,e). As can be seen in Fig. \ ^h)\ the values of 
f{W,0) and f{W,e) are not much different in delay-efficient protocols under ternary feedback. Thus, 
using a single probability f{W, U e) instead of two different probabilities, f{W, 0) and f{W, e), causes 
only minor performance degradation. Under E/NE binary feedback, a delay-efficient protocol with 1-slot 
memory that achieves high throughput has the following structure: 

fiW,0)^^, f{W,lUe) = 0, /(r,l)«l, /(r,e)«i (14) 

Following a collision, only colliding users transmit under ([141 ) whereas only noncoUiding users transmit 
under (fT2l ). Fig. |5] shows that the restriction that E/NE binary feedback imposes compared to ternary 
feedback has a small impact on performance. Fig. |5] also shows that the improvement in performance from 
having (A^ + l)-ary feedback over ternary feedback is only marginal. The ability of users to distinguish 
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Fig. 6: Total throughput and average delay under two-state protocols, memoryless protocols, protocols 
with 1-slot memory, and TDMA. 

the exact numbers of transmissions in collisions does not help much because collisions involving three 
or more transmissions rarely occur under ternary feedback in the high throughput region. 

D. Comparison of Protocols 

We consider N = 5 and compare the performance of four different kinds of protocols: protocols 
with 1-slot memory, two-state protocols in [2], memoryless protocols, and TDMA. In Fig. |6l the delay- 
efficiency boundary of protocols with 1-slot memory is shown assuming no channel feedback (i.e., ACK 
feedback only) because two-state protocols can be implemented without using channel feedback. 121 
proposes two-state protocols of the form pp = 1 and pfj = 1 — "'-{/l — 1/??, where r/ is called a 
short-term fairness parameter, which measures the average duration of consecutive successes. We vary 
l/rj from 0.01 to 1 with a step size of 0.01 to generate the throughput-delay pairs under two-state 
protocols plotted in Fig. |6] The result confirms that total throughput achievable with a two-state protocol 
is bounded from above by N/{2N — 1) = 5/9, whereas protocols with 1-slot memory can attain any 
level of total throughput between and 1. Moreover, in the range of total throughput achievable with 
a two-state protocol, delay-efficient protocols with 1-slot memory yield smaller average delay than two- 
state protocols. Since two-state protocols with pp = 1 can be considered as imposing a restriction of 
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f{W, 0) = /(T, e), these results suggest that there is a significant performance degradation by assigning 
the same transmission probability following the two action-feedback pairs, (1^,0) and (T, e). Moreover, 
by combining Fig. [6] with Fig. |5j we can see that the performance degradation from using a two-state 
protocol instead of a protocol with 1-slot memory is severer when channel feedback is available. 

A symmetric memoryless protocol can be represented by a single transmission probability, which is 
used regardless of past histories. The throughput and the average delay of user i under memoryless 
protocol / G [0, 1] are given by nif) = /(I - f)^-'^ and D{f) = !//(! - f)^~^ - 0.5, respectively. 
The optimal memoryless protocol is thus the transmission probability that maximizes Tj(/) and minimizes 
D{f) at the same time, which is given by /o = 1/N |15|. The throughput-delay pairs under memoryless 
protocols in Fig. [6] are obtained by varying / from to 1. The lower-right point of the throughput-delay 
curve corresponds to the throughput-delay pair under the optimal memoryless protocol, (t(/o), -D(/o)) = 
((1 - l/iV)^-\ iV/(l - l/iV)^-i - 0.5) = (0.41, 11.71). Protocols with 1-slot memory also outperform 
memoryless protocols in that protocols with 1-slot memory support a wider range of achievable total 
throughput than memoryless protocols do. In Section IV, we have seen that TDMA achieves the most 
preferred throughput-delay pair, {l,N/2) = (1,2.5). Protocols with 1-slot memory cannot achieve the 
most preferred throughput-delay pair because memory of length at least (A^ — 1) slots is necessary to 
obtain it. 

VI. Application to Wireless Local Area Networks 

In the idealized slotted multiaccess system considered so far, all packets are of equal size, the trans- 
mission of a packet takes the duration of one slot, and users receive immediate feedback information. 
We relax these assumptions to apply protocols with memory to WLANs. In particular, packet sizes may 
differ across packets, and we take into consideration propagation and detection delay as well as overhead 
such as a packet header and an ACK signal. We consider a WLAN model where users follow a random 
access scheme using transmission probabilities based on CSMA/CA. In the WLAN model, the duration 
of slots depends on the channel state (idle, success, or collision). Let uo, ai, and a2 be the duration of 
a slot when the channel state is idle, success, and collision, respectively. The expressions for ai and a2 
can be found in (14) and (17) of [16], depending on whether the RTS/CTS mechanism is disabled or 
not. Total throughput is expressed as 

PiElP] 
Poao + Piai + P20-2 

where E[P] is the average packet transmission duration and Pq, Pi, and P2 are the fractions of idle, 
success, and collision slots, respectively. In the idealized slotted model, we assume that the size of 
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Fig. 7: Total throughput and average delay under delay-efficient protocols with 1-slot memory and 
memoryless protocols in the WLAN model. 

each packet is equal to the slot duration and ignore overhead so that = ai = a2 = E[P], and 
thus the expression for total throughput in ([T5] ) is reduced to Pi, the fraction of success slots. [161 
shows that the IEEE 802.11 DCF protocol can be approximated by a protocol that prescribes a single 
transmission probability, i.e., a memoryless protocol. The transmission probability corresponding to DCF 
is determined as a function of the minimum and maximum contention window sizes by solving (7) and 
(9) of [tl6il simultaneously. In Appendix B, we derive the expressions for throughput and average delay 
under symmetric protocols with 1-slot memory and memoryless protocols in the WLAN model. 

Fig. |7] depicts the delay-efficiency boundaries of protocols with 1-slot memory under ternary feedback 
and E/NE binary feedback. It also plots the throughput-delay pairs under memoryless protocols as well 
as the throughput-delay pair achieved by the memoryless protocol corresponding to DCF. To obtain 
numerical results, we consider N = 5 and use parameters specified by IEEE 802.11a PHY mode-8 [T7|, 
which are tabulated in Table [III The values of E[P], ao, ai, and a2 are 341.33, 9, 419.56 and 400.48, 
respectively, in ^us. To find the transmission probability corresponding to DCF, we use the minimum 
and maximum contention window sizes CWmin = 16 and CWmax = 1024. An upper bound on total 
throughput can be obtained by setting Pi = 1, which yields f = E[P]/ai ^ 0.8136. We compute 
delay-efficient protocols with 1-slot memory for r between 0.01 and 0.81 with a step size of 0.01 using 
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TABLE II: IEEE 802.11a PHY mode-8 parameters. 
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16 /IS 
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34 /is 



the same numerical method as in Section V. 

Comparing Fig. |7] with Fig. [6l we can see that by utilizing a carrier sensing scheme, memoryless 
protocols resolve contention more efficiently in the WLAN model than in the slotted multiaccess model. 
Since idle slots are much shorter than collision slots in WLAN, idle slots have much smaller effects on 
total throughput and average delay than collision slots have. Thus, in WLAN, transmission probabilities 
can be set low to achieve a success without experiencing many collisions. Carrier sensing also induces 
the turning point of the delay-efficiency boundaries in WLAN to occur around r = 0.69, yielding the 
narrower and steeper right-hand side of the delay-efficiency boundaries. The main findings in Section V.D. 
that protocols with 1-slot memory can achieve smaller average delay for a given level of total throughput 
and a wider range of total throughput compared to memoryless protocols remain valid in the WLAN 
model. On the right-hand side of the delay-efficiency boundaries, i.e., for r > 0.7, the delay-efficient 
protocols with 1-slot memory under ternary feedback have the structure of f{W, 1) = f{W, e) = 0, and 
thus the performance is not affected by having E/NE binary feedback instead of ternary feedback. 

VII. Conclusion 

In this paper, we have investigated how memory can be utilized in MAC protocols to achieve coordi- 
nation without relying on explicit control messages. With (A^ — l)-slot memory, users can share the 
channel as in TDMA. With 1-slot memory, high throughput can be obtained by correlating successful 
users in two consecutive slots, which results in large average delay. Generalizing these results, with M- 
slot memory, where M < N — 2, we can have the first M successful users use the channel alternatingly 
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while a collision created by a non-successful user with a small probability leads to a potential change of 
hands for the collision slot. 

Our framework can be extended in several directions. First, we can consider asymmetric protocols 

to provide quality of service differentiation across users. Second, as the literature on repeated games 
suggests, memory can also be used to sustain cooperation among selfish users. By utilizing memory, 
users can monitor the behavior of other users and punish misbehavior. Lastly, the basic idea of this paper 
can be carried over to a general multi-agent scenario where it is desirable to have one agent behave in 
a different way than others. 

Appendix A 

Derivation of Throughput and Average Delay Under Symmetric Protocols With 
1-Slot Memory in the Slotted Multiaccess System 

Exploiting the symmetry of protocols, we can consider a Markov chain for a representative user whose 
state is defined as a pair of its transmission action and the number of transmissions, instead of a Markov 
chain with the state space A as in Section III. Let user i be the representative user. The state of user i 
in the current slot when the outcome of the previous slot was a is given by s = (oj, A;(a)). We use <S to 
denote the set of all states, as defined in Section n.B. There are total 2N states, which we Ust as 

iT,l),...,iT,N),iW,0),...,{W,N-l). (16) 

Suppose that users follow a symmetric protocol / G J^i,p- Then we can express the transition probabiUties 
across states in terms of /. If user i is in state (T, k), then k users including user i transmit with probability 
f{T, p(T, k)), and {N-k) users with probability f{W, p{W, k)). Hence, a transition from (T, k) to {a', k') 
occurs with probabiUty 

Q{{T,k')\{T,k)-f) = f{T,p{T,k)) J2 

{{x,y)£nl:x+y=k'-l,x<k-l,y<N-k} 

^ ~ ^) /(T, p(r, k)ni - /(T, p{T, k)))'^-'-^ (^^ ~ ^) f{w, p{w, k))y{i - f{w, piw, k))f-''-y 

for k' = 1,...,N, and 

Q{{W,k')\{T,kyj) = (1 - f{T,p{T,k))) Yl (18) 

{{x,y)eNl:x+y=k' , x<k-l,y<N-k} 

~ ^) /(T, p{T, k)ni - f{T, p{T, k)))''-'-- (^^ ~ ^) f{W, piW, (1 - f{W, p{W, k))f-''-y 
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for A;' = 0, . . . , — 1, where No is the set of nonnegative integers. Similarly, if user i is in state (W, k), 
then k users transmit with probability /(T, p{T, k)), and {N — k) users including user i with probability 
f{W,p{W, k)). A transition from (VF, k) to (a', fc') occurs with probability 

Q{{T,k')\{W,k)-f) = f{W,p{WM (19) 

{(x',j/)GN^ ■.x+y=k'-l, x<k, y<N-k-l} 

^) /(T, p{T, k)ril - f{T, p{T, k))f-^ ~^ " ^) fiW, p{W, k)ni - f{W, piW, k))f-^-^'y 
for k' = 1,...,N, and 

Q{{W, k')\{W, k); /) = (!- /(W^, p{W, k))) (20) 

{(a;,j/)eN^ :x+y=k', x<k, y<N-k-l} 

^) /(T, piT, k)rii - /(T, /,(r, k)))>^-- (^^ "^^ " ^) fiw, PiW, k)ni - fiw, PiW, k))f-'^-'-y 

for k' = 0,...,N -1. 

Using (fT7])-(l20l). we obtain a (2iV x 2N) matrix Q(/). The (j, j')-entry of Q(/) is the transition 
probability from state j to state j', where the states are numbered in the order listed in ([T6l ). If / is 
chosen so that /(a,p(a,A;)) G (0,1) for all (a, A;) € S, then the Markov chain is irreducible and there 
exists a unique stationary distribution v(/) on S, represented by a row vector of length 2N, that satisfies 

v(/) = v(/)Q(/) and v(/)e2^ = 1, 

where e2N is a column vector of length 2N whose elements are all 1 |[T2l . Let s* = (T, 1) be the state 
of a successful transmission. Then the throughput of user i is given by the first element of v(/), i.e., 

Tiif) = vis*;f). 

By symmetry, total throughput is given by r(/) = Nriif). 
Using ([5]), we obtain the relationship 

diis;f) = l+ Y Q{s'\s;f)diis';f) (21) 

s'€S\{s'} 

for all s G S. Let dj(/) be the column vector consisting of di(s; /) and Qo(/) be a matrix obtained by 
replacing all the elements in the first column of Q(/) by 0. Then (|2T]) can be expressed as the following 
matrix equation: 

d.(/) = Qo(/)d.(/) + e2^. (22) 
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Solving (I22I ) for dj(/), we obtain 

di(/) = (l2iV - Qo(/))~'e27v, 

assuming that the inverse exists, where l2Ar is the {2N x 2A^) identity matrix. Since the long-run frequency 
of each state is given by the stationary distribution v(/), the average delay of user i can be expressed as 

D(/) = v(/)d,(/)-0.5. 

Also, the average inter-packet time is given by the first element of dj(/), i.e., = di{s*; /). 

Appendix B 

Derivation of Throughput and Average Delay Under Symmetric Protocols With 
1-Slot Memory and Memoryless Protocols in the WLAN Model 

Let / be a symmetric protocol with 1-slot memory. The long-run fractions of idle, success, and collision 
slots are given by 

Po{f) = v{{W,0)-f), (23) 
Pi{f) = v{{T,iy,f)+v{{W,iyj), (24) 
P2{f) = l-Po{f)-Piif). 
Using (flSl) . total throughput under / in the WLAN model can be written as 

r( f) = Plif )E[P] .... 

Po{f)cTo + Pi{f)<ri + P2{f)a2- ^ ^ 

In the WLAN model, we define the average delay of a user as the average waiting time (measured 
in a time unit) until the beginning of its next successful transmission starting from an arbitrarily chosen 
time. We define di{s; f) as the average waiting time of user i starting from the beginning of a slot whose 
outcome yields state s to user i. Let (t{s) be the duration of a slot yielding state s, i.e., cr(WiO) = gq, 
a{a, 1) = o"! for a = T, W, and a{a, k) = 02 for a = T,W and k > 2. Then (|2TI ) can be modified as 

d,{s; f) = a{s) + Q{s'\s;m{s';f). (26) 

s'G5\{s-} 

Let b be the column vector of the durations of slots, i.e., b = {a{T, 1) o-(r,2) • • • a(W,N - 1))^. 
Then (l26l ) can be written as a matrix equation 

di(/) = Qo(/)di(/) + b. (27) 
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Solving (1271 ) for dj(/), we obtain 

d^{f) = ihN - Qo{f))-'h. 

The probability that an arbitrarily chosen time belongs to a slot yielding state s is given by 

Note that a user stays in the initial slot for the half of its duration on average. Let y(/) be the row vector 
consisting of y{s; /). Then the average delay of user i under protocol / can be computed by 

Z?(/)=y(/)[d,(/)-0.5b]. 

Now let / be a symmetric memoryless protocol, which is simply a single transmission probability. 
Then ^ and (ED) can be expressed as Po(/) = (1-/)^ and Pi{f) = Nf{l-f)^~^, respectively, and 
we can use (1251 ) to compute total throughput. When users follow a memoryless protocol /, the average 
waiting time starting from the next slot, di{s; f) — cr{s), is independent of s, and thus we write the value 
as di{f). Manipulating ( [261 ) yields 

N[Po{f)ao + P2{f)a2] + {N - 

The average waiting time until the next slot is given by 

r{f) = iy(/)b - + + 



2-'''' 2[Po(/)cTo + Pl(/Vi + P2(/V2] 

Hence, average delay under memoryless protocol / can be computed by 

D{f)=r{f) + d,{f). 
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